Compact Ancestry Labeling Schemes for Trees of Small Depth

نویسندگان

  • Pierre Fraigniaud
  • Amos Korman
چکیده

An ancestry labeling scheme labels the nodes of any tree in such a way that ancestry queries between any two nodes in a tree can be answered just by looking at their corresponding labels. The common measure to evaluate the quality of an ancestry labeling scheme is by its label size, that is the maximal number of bits stored in a label, taken over all n-node trees. The design of ancestry labeling schemes finds applications in XML search engines. In the context of these applications, even small improvements in the label size are important. In fact, the literature about this topic is interested in the exact label size rather than just its order of magnitude. As a result, following the proposal of an original scheme of size 2 logn bits, a considerable amount of work was devoted to improve the bound on the label size. The current state of the art upper bound is logn + O( √ logn) bits which is still far from the known logn + Ω(log logn) lower bound. Moreover, the hidden constant factor in the additive O( √ log n) term is large, which makes this term dominate the label size for typical current XML trees. In attempt to provide good performances for real XML data, we rely on the observation that the depth of a typical XML tree is bounded from above by a small constant. Having this in mind, we present an ancestry labeling scheme of size logn + 2 log d + O(1), for the family of trees with at most n nodes and depth at most d. In addition to our main result, we prove a result that may be of independent interest concerning the existence of a linear universal graph for the family of forests with trees of bounded depth. ∗This research is supported in part by the ANR project ALADDIN, by the INRIA project GANG, and by COST Action 295 DYNAMO.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Compact Suffix Trees Resemble PATRICIA Tries: Limiting Distribution of the Depth

Suffix trees are the most frequently used data structures in algorithms on words. In this paper, we consider the depth of a compact suffix tree, also known as the PAT tree, under some simple probabilistic assumptions. For a biased memoryless source, we prove that the limiting distribution for the depth in a PAT tree is the same as the limiting distribution for the depth in a PATRICIA trie, even...

متن کامل

An Optimal Labeling Scheme for Ancestry Queries

An ancestry labeling scheme assigns labels (bit strings) to the nodes of rooted trees such that ancestry queries between any two nodes in a tree can be answered merely by looking at their corresponding labels. The quality of an ancestry labeling scheme is measured by its label size, that is the maximal number of bits in a label of a tree node. In addition to its theoretical appeal, the design o...

متن کامل

Labeling Schemes for Bounded Degree Graphs

We investigate adjacency labeling schemes for graphs of bounded degree ∆ = O(1). In particular, we present an optimal (up to an additive constant) log n+ O(1) adjacency labeling scheme for bounded degree trees. The latter scheme is derived from a labeling scheme for bounded degree outerplanar graphs. Our results complement a similar bound recently obtained for bounded depth trees [Fraigniaud an...

متن کامل

A Simple and Optimal Ancestry Labeling Scheme for Trees

We consider ancestry labeling schemes: Given a rooted tree T , assign a binary string (label) to each node, such that given the labels of any two nodes, one can determine whether the first is an ancestor of the second in T . Recently, Fraigniaud and Korman [STOC’10] showed that such labels can be assigned using log n+O(log log n) bits per label, solving a long standing open question and matchin...

متن کامل

Optimal Distance Labeling Schemes for Trees

Labeling schemes seek to assign a short label to each node in a network, so that a function on two nodes (such as distance or adjacency) can be computed by examining their labels alone. For the particular case of trees, following a long line of research, optimal bounds (up to low order terms) were recently obtained for adjacency labeling [FOCS ’15], nearest common ancestor labeling [SODA ’14], ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/0902.3081  شماره 

صفحات  -

تاریخ انتشار 2009